On Yahoo Answers, Long Answers are Best

نویسندگان

  • Alina Beygelzimer
  • Ruggiero Cavallo
چکیده

We provide an analysis of best answers (as chosen by questioners) on Yahoo Answers, a popular online Q&A site with millions of monthly contributors. Our analysis is done mainly through the lens of prediction: we compile a dataset that is as large and fine-grained as any considered before, generate features across a range of different classes, and build a classifier to predict which answers will be selected as “best”. On the dataset as a whole, despite the breadth and sophistication of our features and learning framework, we achieve virtually no performance edge over the following simple baseline: choose the longest answer. Propelled by this unexpected discovery, we perform a detailed analysis of answer length and how it relates to other variables of interest, such as answer time and number of answers. We explore subsets of the data designed to probe into areas where the longest-answer baseline may be handicapped, but we consistently find that it is competitive with our full-featured learner. Our results suggest future directions of study, e.g., controlled experimentation or user interviews, which may shed further light on why answer length is such a good proxy for (the questioner’s estimation of) answer quality.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing Users’ Health Information Needs Based on the Yahoo Answers®

Background and Aim: People refer to virtual information resources for answering their medical questions. One of these resources includes question and answering (Q&A) sites in medicine. This study aims to analyze health  questions posted on the Yahoo Answers to identify health information needs, the motivations for asking questions, evaluation of information user satisfaction resulted from recei...

متن کامل

An Entity-Based approach to Answering Recurrent and Non-Recurrent Questions with Past Answers

Community question answering (CQA) systems such as Yahoo! Answers allow registered-users to ask and answer questions in various question categories. However, a significant percentage of asked questions in Yahoo! Answers are unanswered. In this paper, we propose to reduce this percentage by reusing answers to past resolved questions from the site. Specifically, we propose to satisfy unanswered q...

متن کامل

SPAN: Understanding a Question with Its Support Answers

Matching a question to its best answer is a common task in community question answering. In this paper, we focus on the non-factoid questions and aim to pick out the best answer from its candidate answers. Most of the existing deep models directly measure the similarity between question and answer by their individual sentence embeddings. In order to tackle the problem of the information lack in...

متن کامل

Ranking Answers and Web Passages for Non-factoid Question Answering: Emory University at TREC LiveQA

This paper describes a question answering system built by a team from Emory University to participate in TREC LiveQA’15 shared task. The goal of this task was to automatically answer questions posted to Yahoo! Answers community question answering website in real-time. My system combines candidates extracted from answers to similar questions previously posted to Yahoo! Answers and web passages f...

متن کامل

Automatic Identification of Best Answers in Online Enquiry Communities

Online communities are prime sources of information. The Web is rich with forums and Question Answering (Q&A) communities where people go to seek answers to all kinds of questions. Most systems employ manual answer-rating procedures to encourage people to provide quality answers and to help users locate the best answers in a given thread. However, in the datasets we collected from three online ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015